智能论文笔记

Differentiable Rendering for Pose Estimation in Proximity Operations

Ramchander Rao Bhaskara , Roshan Thomas Eapen , Manoranjan Majji

分类：计算机视觉 | 机器人

2022-12-24

Differentiable rendering aims to compute the derivative of the image rendering function with respect to the rendering parameters. This paper presents a novel algorithm for 6-DoF pose estimation through gradient-based optimization using a differentiable rendering pipeline. We emphasize two key contributions: (1) instead of solving the conventional 2D to 3D correspondence problem and computing reprojection errors, images (rendered using the 3D model) are compared only in the 2D feature space via sparse 2D feature correspondences. (2) Instead of an analytical image formation model, we compute an approximate local gradient of the rendering process through online learning. The learning data consists of image features extracted from multi-viewpoint renders at small perturbations in the pose neighborhood. The gradients are propagated through the rendering pipeline for the 6-DoF pose estimation using nonlinear least squares. This gradient-based optimization regresses directly upon the pose parameters by aligning the 3D model to reproduce a reference image shape. Using representative experiments, we demonstrate the application of our approach to pose estimation in proximity operations.

translated by 谷歌翻译

An FPGA framework for Interferometric Vision-Based Navigation (iVisNav)

Ramchander Rao Bhaskara , Kookjin Sung , Manoranjan Majji

分类：机器人

2022-08-07

基于干涉视觉的导航（IVISNAV）是一种用于自主接近操作的新型光电传感器。 ivisnav采用激光发射结构化的信标，并通过测量传输激光脉冲的相变的变化来精确地表征六个自由度相对运动速率。 Ivisnav的嵌入式软件包必须有效地处理高频动力学，以进行健壮的感应和估计。本文开发了一种新的嵌入式系统，用于基于最小二乘的速率估计。所得系统能够与光子学连接并在现场可编程的门数阵列中实现估计算法。嵌入式软件包被证明是使用有限的精度算术进行高速计算的硬件/软件共同设计估计程序。将有限精度FPGA硬件设计的准确性与MATLAB上算法的浮点软件评估进行了比较，以基于其性能和与错误度量的统计一致性。实施结果证明了使用IVISNAV进行高速接近导航的FPGA计算功能的实用性。

translated by 谷歌翻译

A novel state connection strategy for quantum computing to represent and compress digital images

Md Ershadul Haque , Manoranjan Paul , Tanmoy Debnath

分类：计算机视觉

2022-12-14

Quantum image processing draws a lot of attention due to faster data computation and storage compared to classical data processing systems. Converting classical image data into the quantum domain and state label preparation complexity is still a challenging issue. The existing techniques normally connect the pixel values and the state position directly. Recently, the EFRQI (efficient flexible representation of the quantum image) approach uses an auxiliary qubit that connects the pixel-representing qubits to the state position qubits via Toffoli gates to reduce state connection. Due to the twice use of Toffoli gates for each pixel connection still it requires a significant number of bits to connect each pixel value. In this paper, we propose a new SCMFRQI (state connection modification FRQI) approach for further reducing the required bits by modifying the state connection using a reset gate rather than repeating the use of the same Toffoli gate connection as a reset gate. Moreover, unlike other existing methods, we compress images using block-level for further reduction of required qubits. The experimental results confirm that the proposed method outperforms the existing methods in terms of both image representation and compression points of view.

translated by 谷歌翻译

Analysis and prediction of heart stroke from ejection fraction and serum creatinine using LSTM deep learning approach

Md Ershadul Haque , Salah Uddin , Md Ariful Islam , Amira Khanom , Abdulla Suman , Manoranjan Paul

分类：计算机视觉 | 机器学习

2022-09-28

大数据和深度学习的结合是一项破坏世界的技术，如果正确使用，可以极大地影响任何目标。随着深度学习技术中大量医疗保健数据集和进步的可用性，系统现在可以很好地预测任何健康问题的未来趋势。从文献调查中，我们发现SVM用于预测心力衰竭的情况，而无需关联客观因素。利用电子健康记录（EHR）中重要历史信息的强度，我们利用长期记忆（LSTM）建立了一个智能和预测的模型，并根据该健康记录预测心力衰竭的未来趋势。因此，这项工作的基本承诺是使用基于患者的电子药用信息的LSTM来预测心脏的失败。我们已经分析了一个数据集，该数据集包含在Faisalabad心脏病学研究所和Faisalabad（巴基斯坦旁遮普邦）的盟军医院收集的299例心力衰竭患者的病历。这些患者由105名女性和194名男性组成，年龄在40岁和95岁之间。该数据集包含13个功能，这些功能报告了负责心力衰竭的临床，身体和生活方式信息。我们发现我们的分析趋势越来越多，这将有助于促进心中预测领域的知识。

translated by 谷歌翻译

Rice Leaf Disease Classification and Detection Using YOLOv5

Md Ershadul Haque , Ashikur Rahman , Iftekhar Junaeid , Samiul Ul Hoque , Manoranjan Paul

分类：计算机视觉 | 人工智能

2022-09-04

全球一百多个国家的主食是大米（Oryza sativa）。大米的种植对于全球经济增长至关重要。但是，农业产业面临的主要问题是水稻疾病。农作物的质量和数量下降了，这是主要原因。由于任何国家的农民对水稻疾病都没有太多了解，因此他们无法正确诊断稻叶疾病。这就是为什么他们不能适当照顾米叶的原因。结果，生产正在减少。从文献调查中，Yolov5表现出更好的结果与其他深度学习方法相比。由于对象检测技术的不断发展，Yolo家族算法具有非常高的精度和更好的速度，已在各种场景识别任务中使用，以构建稻叶疾病监测系统。我们已经注释了1500个收集的数据集，并提出了基于Yolov5深学习的水稻疾病分类和检测方法。然后，我们训练并评估了Yolov5模型。模拟结果显示了本文提出的增强Yolov5网络的对象检测结果的改进。所需的识别精度，召回，MAP值和F1得分的水平分别为90 \％，67 \％，76 \％和81 \％\％被视为性能指标。

translated by 谷歌翻译

Efficient Motion Modelling with Variable-sized blocks from Hierarchical Cuboidal Partitioning

Priyabrata Karmakar , Manzur Murshed , Manoranjan Paul , David Taubman

分类：计算机视觉

2022-08-28

具有基于块体系结构的运动建模已被广泛用于视频编码中，其中框架分为固定尺寸的块，这些块是独立补偿的。这通常会导致编码效率低下，因为固定尺寸的块几乎与对象边界不符。尽管已经引入了层次结构分区来解决这一问题，但运动矢量的增加限制了收益。最近，与立方体分配的图像的近似分割已经普及。可变大小的矩形片段（立方体）不仅容易适应基于块的图像/视频编码技术，而且还可以很好地与对象边界保持一致。这是因为立方分区基于同质性约束，从而最大程度地减少了平方误差的总和（SSE）。在本文中，我们研究了针对可扩展视频编码中使用的固定尺寸块的运动模型的潜力。具体而言，我们使用图片组（GOP）中的锚框的立方分区信息构建了运动补偿帧。然后，预测的当前帧已用作基础层，同时使用可扩展的HEVC编码器编码当前帧作为增强层。实验结果确认4K视频序列上节省了6.71％-10.90％的比特率。

translated by 谷歌翻译

Efficient dynamic point cloud coding using Slice-Wise Segmentation

Faranak Tohidi , Manoranjan Paul , Anwaar Ulhaq

分类：计算机视觉

2022-08-17

随着沉浸式视频序列的快速增长，实现无缝和高质量的压缩3D含量更为关键。 MPEG最近开发了一种基于视频的点云压缩（V-PCC），用于动态点云编码。但是，使用V-PCC进行重建的点云会遭受不同的工件的影响，包括在应用现有视频编码技术之前在预处理过程中丢失数据，例如高效视频编码（HEVC）。贴片世代和2D投影中3D的自封点是使用V-PCC丢失数据的主要原因。本文提出了一种新方法，将重叠切片作为贴片生成的替代方法，以减少生成的贴片数量和丢失的数据量。在提出的方法中，整个点云已根据自锁定点的数量将整个点云分为横截面，以便在斑块生成过程和投影中可以最大程度地减少数据丢失。为此，考虑了可变数量的层，部分重叠以保留自锁定点。所提出的方法的额外优势是减少位置的需求并使用切片底座编码几何数据。实验结果表明，与标准的V-PCC方法相比，提出的方法比标准V-PCC方法更灵活，改善了率延伸性能，并且与标准V-PCC方法相比，数据丢失显着降低。

translated by 谷歌翻译